Fuzzy relational thesauri in information retrieval: automatic knowledge base expansion by means of classified textual data

نویسندگان

  • Domonkos Tikk
  • Jae Dong Yang
  • Péter Baranyi
  • Anikó Szakál
چکیده

In our ongoing project we develop a tool which provides domain engineers with a facility to create fuzzy relational thesauri (FRT) describing subject domains. The created fuzzy relational thesauri can be used as knowledge base for an intelligent information agent when answering user queries relevant to the described domains, or for textual searching on the web. However, the manual creation of (fuzzy) thesauri is quite tedious process if the source of data from which the domain engineer may select concepts and instances for the thesaurus is not well organized or structured. That is the typical case of textual data bases. In order to ease FRT creation process we make use of a small starting FRT and our text categorization technique that temporarily expands FRT while doing the supervised learning part of text categorization. This by-product of categorization is then used for enlarging automatically or semi-automatically the final FRT.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fuzzy relational thesauri expansion using categorized textual data

In our ongoing project we develop a tool which provides domain engineers with a facility to create fuzzy relational thesauri (FRTi) describing subject domains. The created FRTi can be used as knowledge base for an intelligent information agent when answering user queries relevant to the described domains, or for textual searching on the web. However, the manual creation of thesauri is quite ted...

متن کامل

Creation and Maintenance of Query Expansion Rules

In an information retrieval system, a thesaurus can be used for query expansion, i.e. adding words to queries in order to improve recall. We propose a semi-automatic and interactive approach for the creation and maintenance of domain-specific thesauri for query expansion. Domain-specific thesauri are especially required in highly technical domains where the use of general thesauri for query exp...

متن کامل

User Comprehension and Searching with Information Retrieval Thesauri

While information retrieval thesauri may improve search results, there is little research documenting whether general information system users employ these vocabulary tools. This article explores user comprehension and searching with thesauri. Data were gathered as part of a larger empirical query-expansion study involving the ProQuest‚ Controlled Vocabulary. The results suggest that users’ kno...

متن کامل

Query Architecture Expansion in Web Using Fuzzy Multi Domain Ontology

Due to the increasing web, there are many challenges to establish a general framework for data mining and retrieving structured data from the Web. Creating an ontology is a step towards solving this problem. The ontology raises the main entity and the concept of any data in data mining. In this paper, we tried to propose a method for applying the "meaning" of the search system, But the problem ...

متن کامل

Similarity Thesauri and Cross-Language Retrieval

This paper describes a method for constructing a thesaurus automatically from a corpus of suitable documents, using standard information retrieval methods. The resulting thesauri can be used for user-initiated query expansion, automatic query expansion, as well as cross-language retrieval. Researchers at the Swiss Federal Institute of Technology in Zürich developed and evaluated this method in ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002